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Abstract 

We present in this short note a polynomial graph extension procedure that can 
be used to improve any graph isomorphism algorithm. This construction propagates 
new constraints from the isomorphism constraints of the input graphs (denoted by 
G(V, E) and G'(V, E')). Thus, information from the edge structures of G and G' is 
"hashed" into the weighted edges of the extended graphs. A bijective mapping is an 
isomorphism of the initial graphs if and only if it is an isomorphism of the extended 
graphs. As such, the construction enables the identification of pair of vertices % G V 
and %' G V that can not be mapped by any isomorphism h* : V — > V (e.g. if the 
extended edges of i and i! are different). A forbidding matrix F, that encodes 
all pairs of incompatible mappings (i is constructed in order to be used by 
a different algorithm. Moreover, tests on numerous graph classes show that the 
matrix F might leave only one compatible element for each i G V. 

1 Introduction and Notations 

In theoretical computer science, GI is one of the only NP problems that is not known to be 
either in P or NP — P (we assume P ^ NP) and a lot of effort has been done to classify 
it. Proofs of polynomial time algorithms are available for many graph classes [1,3,4], 
but, however, all existing algorithms are still exponential for some well-known families 
of difficult graphs, e.g. regular graph isomorphism is Gl-complete [2,5] (if regular graphs 
can be tested for isomorphism in polynomial time, then so can be any two graphs). 

We denote the adjacency matrices of G and G' by M and M'. The number of vertices 
(denoted by \G\ or \V\) is commonly referred to as the graph order. A mapping between 
G and G' is represented by a bijective function on the vertex set h : V — > V. We say 
that h* is an isomorphism if and only if G E (h* (i) , h* (J)) G E' and the graph 
isomorphism ( GI) problem is to decide whether or not such an isomorphism exists. 

A critical problem of all tested algorithms appears in the following situation: if there 
is no edge between vertex i and j in G (i.e. = 0) and no edge between h(i) and 

h(j) in G', than the assignment — > (h(i), h(j)) is not seen as a conflict — there is no 
mechanism to directly detect whether and (h(i), h(J)) are indeed compatible or not. 
But, by exploiting the structure of the graph, one can find many conditions in which (i, j) 
and (h(i), h(j)) are incompatible even if they are both disconnected (e.g. by checking the 
shortest path between them). 
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2 Polynomial graph extension 



We define the |V| x \V\ matrix N a , in which the element iV°- is the number of paths of 
length a (i.e. with a edges) from i to j. Obviously A^ 1 = M, and we now show that N a+1 
can be computed in polynomial time from M and N a using the following algorithm: 

Algorithm 1 Graph extension in polynomial time 
Input : M and N a 
Result : N a+1 

1. Set all elements of N a+1 to 

2 . For i = l to \V\ 

For j = i + l to \V\ 
If M[i,j] = 1 then 
For k = 1 to \V\ 

• N a+1 [i, k] = N a+1 [i, k] + N a [j, k] 

• N a+1 \j, k] = N a+1 [j, k] + N a [i, k] 

• N a+1 [k,j} = N a+1 [j,k] and N a+1 [k,i] = N a+1 [i,k] 



The extended graph is straightforwardly defined as the weighted graph with vertex 
set V and weighted edges E a such that if N£ ^ 0, then N?A e E. Two graphs 
G and G" are isomorphic if and only if their extended graphs are isomorphic — because 
the same extending operations are applied in the same manner for any two isomorphic 
vertices i and h*(i). 

An important advantage of constructing all matrices A^ 1 , A^ 2 , . . . N a is the early de- 
tection of incompatible (forbidden) assignments, i.e. vertices (i G V,i' G V) that can 
never be mapped by an isomorphism. 

Definition 1 (Compatible assignment) Vertices i G V and i' G V are compatible if and 
only if: (i) = Nfy and (ii) all the values from line i of N a can be found in line i! of 
N' a and vice versa. 

Indeed, if h* is an isomorphism, then all assignment (i — > h*(i)) are compatible; each 
element of line i of A^", can also be found at position (h*(i),h*(j)) in line h*(j) 

of N' a . Therefore, any GI algorithm should never map two incompatible (forbidden) 
vertices. We introduce a matrix F encoding forbidden mappings, i.e. if F^ — 1, i is 
never mapped to i' . This matrix is empty at start (all elements are 0), and the extension 
algorithm gradually fills its elements while constructing the matrices A^ 1 , A^ 2 , . . . N a . 

The matrices A^ 1 , A^ 2 , A^ 3 , . . . are very rich in information that is implicitly checked 
via the matrix F. Each edge value from the extended graph is in fact a a hash function of 
some larger structures in the initial graph. Indeed, the fact that an assignment i — > i' is 
not forbidden (i.e. Fa> = 0) implies numerous hidden conditions: i and i! need to have the 
same degree (otherwise Nf i ^ N-? if ), they need to be part in the same number of triangles 
(otherwise, NfJ. = N^,), they need to have the same number of 2-step neighbors, etc. 
Many other such theoretical conditions can be derived and proved, but the goal of this 
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specific paper is only to present a very practical, high-speed algorithm; such theoretical 
conditions are investigated in greater detail in a completely different theoretical study. 

Finally, we note that our practical C++ implementation uses unsigned long integer 
variables encoded on 64 bits. However, for large values of a, iV"- can exceed 2 32 — 
1; therefore, we consider all addition operations Modulo 2 32 (in our C++ version, the 
variables are encoded so that 2 32 — 1 + 1 = 0). This observation does not change the 
fact that f a (h*) = when h* is an isomorphism, because if N"j = N'^ h ^, then = 
N'h{i)h{j) {Modulo 2 32 ). However, it is still theoretically possible to have the Modulo 
equality without the non-Modulo equality. 

3 Conclusion 

We implemented several algorithms, both exact and heuristics using this property (espe- 
cially the matrix F). Generally speaking, such an algorithm consists in two stages: (i) the 
graph extension (ii)the effective algorithm that can be quite naive. The first stage builds 
the information-rich adjacency matrix N a and it also provides a matrix F of forbidden 
vertex assignments. 

Numerous tests of such an algorithm with very large graphs show worst-case behavior 
of polynomial time. Only the strongly regular graphs can show more difficulties, but 
we tested only several strongly regular graphs with up to 275 vertices and the behavior 
seems similar. Larger strongly regular graphs are not classified, and since there is no 
practical algorithm to generate them, we restricted to examples publicly available on the 
Internet — the McLaughlin Graph with 275 vertices. 

The extending procedure provides additional evidence that the graph isomorphism 
can be (at least in practice) solved in polynomial time for almost all graph types we 
know. 
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